Combined audio and visual streams analysis for video sequence segmentation
نویسندگان
چکیده
We present a new approach to video sequence segmentation into individual shots. Unlike previous approaches, our technique segments the video sequence by combining two streams of information extracted from the visual track with audio track segmentation information. The visual streams of information are computed from the coarse data in a 3-D wavelet decomposition of the video track. They consist of (i) information derived from temporal edges detected along the time evolution of the intensity of each pixel in temporally sub-sampled spatially ltered coarse frames, and (ii) information derived from the coarse spatio-temporal evolution of intra-frame edges in the spatially ltered coarse frames. Our approach is particularly matched to progressively transmitted video.
منابع مشابه
مقایسه اثر بخشی ریلکسیشن پیشرونده، ترکیب ریلکسیشن با تحریکات ریتمیک نوری و ترکیب ریلکسیشن با تحریکات ریتمیک صوتی بر ضربان قلب و فشار خون دانشجویان
Background and purpose: The aim of this research was to compare the efficacy of relaxation, and relaxation combined by periodic visual stimulation and periodic audio stimulation on blood pressure and heart rate of university students. Materials and methods: This experimental study was conducted in 36 psychology students in Allameh Tabatabaee University. The students were randomly selected and...
متن کاملThe effects of segmentation and redundancy methods on cognitive load and vocabulary learning and comprehension of English lessons in a multimedia learning environment
The present study was conducted with the aim of the effects of segmentation and redundancy methods on cognitive load and vocabulary learning and comprehension of English lessons in a multimedia learning environment.The purpose of this study is an applied research and a real experimental study. The statistical population of the present study includes all people aged 14 to 16 who are enrolled in ...
متن کاملAudio/Visual Independent Components
This paper presents a methodology for extracting meaningful audio/visual features from video streams. We propose a statistical method that does not distinguish between the auditory and visual data, but one that operates on a fused data set. By doing so we discover audio/visual features that correspond to events depicted in the stream. Using these features, we can obtain a segmentation of the in...
متن کاملOn-line knowledge- and rule-based video classification system for video indexing and dissemination
Current information and communication technologies provide the infrastructure to transport bits anywhere, but do not indicate how to easily and precisely access and/or route information at the semantic level. To facilitate intelligent access to the rich multimedia data over the Internet, we develop an on-line knowledgeand rule-based video classification system that supports automatic ‘‘indexing...
متن کاملMulti-modal audio-visual event recognition for football analysis
The recognition of events within multi-modal data is a challenging problem. In this paper we focus on the recognition of events by using both audio and video data. We investigate the use of data fusion techniques in order to recognise these sequences within the framework of Hidden Markov Models (HMM) used to model audio and video data sequences. Specifically we look at the recognition of play a...
متن کامل